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Living cells use readout molecules to record the state of receptor proteins, similar to measurements or copies in 
typical computational devices. But is this analogy rigorous? Can cells be optimally efficient, and if not, why? 
We show that, as in computation, a canonical biochemical readout network generates correlations; extracting 
no work from these correlations sets a lower bound on dissipation. For general input, the biochemical network 
cannot reach this bound, even with arbitrarily slow reactions or weak thermodynamic driving. It faces 
an accuracy-dissipation trade-off that is qualitatively distinct from and worse than implied by the bound, 
and more complex steady-state copy processes cannot perform better. Nonetheless, the cost remains close 
to the thermodynamic bound unless accuracy is extremely high. Additionally, we show that biomolecular 
reactions could be used in thermodynamically optimal devices under exogenous manipulation of chemical 
fuels, suggesting an experimental system for testing computational thermodynamics. 
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If it were possible to perform many measurements us¬ 
ing a single bit of memory without putting in work. 
Maxwell’s Demon could use the information gained to 
violate the second law of thermodynamics and extract 
net work from an equilibrium system. Landauer’s insight 
that computational processes require a physical instanti¬ 
ation and therefore have thermodynamic consequenceJi^ 
is key to exorcising the Demon, and the survival of the 
second law has been demonstrated in a range of physi¬ 
cal model^^Hll unlike Maxwell’s thought experiment, 
the correlations generated by a measurement or copy 
are not used to perform work, the cycle increases the 
entropy of the universe (by at least fc In 2 if the mea¬ 
surement is binary, perfectly accurate and has a 50/50 
outcome(PI^. Landauer and others have provided specific 
physical implementations of binary devices along with 
protocols that ach ieve the thermodynamic boun d f or 
measurement cycle^^ or memory e rasur e protocolJiJ^lt^. 
Examples include magnetic systemJil^, or single parti¬ 
cles in piston^^^sHm 

Do biomolecules perform measurement and copying 
within this computational paradigm? Many biological 
processes invol ve crea ting long-lived molecular copies of 
other molecule^^IIillll, Perhaps the most tantalising anal¬ 
ogy is in the cellular sensing of external ligand concentra¬ 
tions. Following the seminal work of Berg and PurcelP^, 
it has been shown that cells can reduce their sensing error 
by averaging a noisy receptor signal over tim (j^^''^^ l Re¬ 
cent studies claim that cells implement time integration 
by dissipatively copying receptor states into the chemi¬ 
cal modification states of readout molecule^^HM] other 
authors have highlighted the necessity of dissipation in 
adaptiorf2^^^ and kinetic proofreading^. 

While it has been noted that there is a connection be¬ 
tween the dissipation present in cellular copying and the 
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thermodynamics of computatiorP^^t^, the nature of the 
connection remains nebulous. How do cellular protocols 
compare to the canonical copy protocols typically consid¬ 
ered in the computational literature? Can cellular sys¬ 
tems reach the fundamental thermodynamic limit on the 
accuracy and energetic cost of a measurement? If not, 
what is the underlying reason for the additional dissipa¬ 
tion? Is it due to the nature of the biomolecular reac¬ 
tions, or due to the design of the signalling network? 
And if cellular systems cannot reach the fundamental 
limit, how does the trade-off between energy and pre¬ 
cision differ from the ideal case? To answer these ques¬ 
tions, it is necessary to construct a rigorous mapping be¬ 
tween cellular processes and computational copying. Un¬ 
derstanding the connection between the thermodynam¬ 
ics of co mputation and the thermodynamics of biological 
processegSSESEZEOHSSI a mechanistic level would enable 
translating quantitative, not just qualitative, results from 
the literature on computation. 

Here, we map a canonical push-pull signalling motif 
to a computational copy device at the level of the master 
equation. The uncontrolled loss of correlations generated 
between receptors and readouts (rather than the widely 
discussed costs of “erasure”) sets a lower bound on the 
dissipation. We show that the push-pull network cannot 
converge on this fundamental limit for a general input sig¬ 
nal, regardless of its parameters, and we prove that more 
complex biochemical networks involving multiple steps or 
parallel pathways cannot perform any better. Remark¬ 
ably, however, biochemical networks can operate close to 
the lower bound at moderate-to-high accuracy, even at 
a high rate of copying. Further, cellular networks are 
naturally adaptive, dissipating less when the sampling 
challenge is reduced. Finally we show that an artificial 
copy device based on biochemical reactions can achieve 
the thermodynamic bound with exogenous manipulation 
of fuel concentrations, providing an alternative platform 
for investigating the thermodynamics of computation. 
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I. RESULTS 

Mapping a cellular biochemical network to a copy 
process. We consider bi-functional kinase systems, 
which are common in bacterigP^. The bi-functional ki¬ 
nase either tends to phosphorylate or dephosphorylate a 
readout x, depending on the ligand-binding state of the 
receptor to which the kinase is coupled (Fig. [^. If the 
receptor R is bound to ligand L, then the bi-functional 
kinase acts as a kinase, catalyzing phosphorylation and 
dephosphorylation reactions that are both coupled to 
ATP hydrolysis. If the receptor is not bound to ligand, 
then the bi-functional kinase acts as a phosphatase, cat¬ 
alyzing phosphorylation and dephosphorylation reactions 
that are uncoupled from ATP hydrolysis. Such a system 
can be described by the following reactions 

R + L^RL, 

RL -I- a; -I- ATP^i?L + x* + ADP, (1) 

R + x*^R -I- a; -I- P, 

where the kinase/phosphatase activity is coarse-grained 
into the ligand-binding state of the receptor. Here x and 
X* represent unphosphorylated and phosphorylated read¬ 
out states, respectively. In this manuscript we will use R, 
RL, X and x* to represent both molecular species and the 
number of molecules of that type, depending on context. 

For simplicity, we take [L\ to be constant, and assume 
the system is maintained in a non-equilibrium steady 
state: [ADP], [ATP] and [P] are fixed. We treat phospho¬ 
rylation and dephosphorylation as instantaneous second- 
order reactions. Thus 

R ^ RL, RL + X ^ RL + X* , R + X* ^ R + X. (2) 

k2 ki fes 

The master equation for this system, and the chemical 
kinetics approximation, are provided in Supplementary 
Discussion 1. We emphasize that the reactions within 
each equation are the microscopic reverses of each other, 
while the reactions of the second and third equations cor¬ 
respond to distinct reaction paths. This yields a thermo¬ 
dynamically consistent model. 

Qualitatively, this circuit performs copies (or measure¬ 
ments) in the following way. As spontaneous phospho¬ 
rylation and dephosphorylation in the absence of the 
bi-functional kinase occur at a low rate which we take 
to be zero, the two chemical modification states of the 
readout are analogous to two stable states of a mem¬ 
ory bit separated by a large bar rier, a s widely considered 
in the computational literatur c ! ^F | i4| There are, how¬ 
ever, two separate paths between the two wells, via ex¬ 
change of phosphate with ATP and via exchange of phos¬ 
phate with the cytosol. Moreover, the ATP-independent 
(de)phosphorylation reaction has a high yield of x in 
equilibrium, whereas the ATP-coupled reaction has an 
intrinsically high yield of x*. The resultant extended 
free-energy landscape for a single readout that explic¬ 
itly considers ATP turnover is illustrated in Fig.j^B. The 



FIG. 1. A canonical signaling network. (A) The signaling 
network utilizes a receptor that acts as a bifunctional ki¬ 
nase/phosphatase: when bound to ligand, it catalyses the 
activation of the downstream readout; when unbound, it cat¬ 
alyzes the deactivation of the downstream readout. Free- 
energy dissipation due to the use of fuel, coarse-grained from 
this representation, drives the reactions. (B) Schematic free- 
energy landscape of a single readout molecule in the bio¬ 
chemical network. We plot the free energy G as a func¬ 
tion of the number of ATP molecules that are converted into 
ADP molecules, for the two states x and x*. Thermody¬ 
namically favourable transitions are shown with solid arrows, 
and unfavourable transitions with dashed arrows. The pres¬ 
ence of catalysts R and RL makes these transitions faster, 
and thereby push the system towards x or x*, but the overall 
thermodynamic drive is fixed for both reactions. 


presence of RL lowers the barrier between pairs of states 
connected by ATP turnover, whereas the presence of R 
lowers the barrier between states not connected by ATP 
turnover. In this way, the receptor will, depending on its 
ligand-binding state, push readouts towards either x or 
X*, which can then be thought of as copying the state of 
the rec eptor into the chemical modification state of the 
readoutpS^. We will now make this analogy concrete, 
allowing a quantitative analysis of biochemical copying. 

Since each readout molecule provides a stable mem¬ 
ory of the receptor state, the readout molecules collec¬ 
tively encode the receptor history. This enables time 
integration of the receptor signal a nd henc e enhanced 
accuracy of concentration estimates are not 

concerned, however , with the p recision of sensing a con¬ 
stant concentratioi i^^ l ^'^ I ^^ I ^^Ft) nor with the “learning 
rate” between the ligand concentration and the network, 
which is important when changes in external concentra¬ 
tions are more rapicP^®^. Rather, we are interested in 
whether the readout reaction can be rigorously described 
in terms of a copy process, and if so, how this cellular pro¬ 
tocol compares to optimal quasistatic computational pro¬ 
tocols involving the manipulatio n of e nergy landscapes 
that are common in the literatur^^llllll. 

The biochemical network as a copy process. Let us 

consider the dynamics necessary for the biochemical net¬ 
work to truly perform stochastic copying. Each readout 
molecule should perform a copy of any one ligand-bound 
receptor at a rate fc^pyi O’^e unbound receptor 
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at a rate fc^py These copies are performed with accura¬ 
cies Sul and sr respectively, and the result of the copy 
is independent of the prior state of the readout. Thus a 
copy of RL (R) returns x* (x) with probability srl (sr)- 
At the level of the master equation, old copies in state x 
should be converted into new copies in state x* at a rate 

= (NRLk^J^ySRL + - sr))x, (3) 

in which Nr and Nrl are numbers of receptors in each 
state, and x* should be converted into a; at a rate 

= {NRLk^opyi^ - Srl) + NRk^^pySR)x*. ( 4 ) 

Returning to our actual biochemical network, Eq. 
specifies the dynamics of x and x*. The transitions occur 
with rates 

= (kslRL] + k5[R])x, 

( 5 ) 

= (fc 4 [i?L] -I- ke[R])x*. 

Comparing Eqsj3][^, we see that the biochemical network 
is equivalent to a stochastic copy process with 

kcopy = (ks + kA)/V, fc^py = (fcg -f ke)/V, (6) 

in which V is the volume of the system and the copy 
accuracies are 

Srl = k^/iks + ki), sr = fee/(fcs -f fee). (7) 

Copies are therefore made at a rate {k^J^^NRL -f 
^mpy^R)^T, which gives an average copy rate 

hcopy = xt {{ks + k4){[RL]) + (fcg -I- fc5)([7?])), (8) 

in which xt is the total number of readouts. Thus the 
biochemical network can be directly mapped to a stochas¬ 
tic copying process. 

Energetic cost per copy cycle. In the non¬ 
equilibrium steady state, the average dissipation rate of 
chemical free energy (rCchem) is 

, ( 9 ) 


d’chem — f^flux(A/ii -f A^2) — RfiuxkT ln 


k3ke \ 
kikz ) 


Furthermore, the fraction of phosphorylated readout / is 


. ^ fcsPT ^ 5(1 -P) 

(fca -I- k4)p + (fcs -I- fc 6 )(l - p) ’ 


( 12 ) 


giving 

_ (fcsfce - k4k3)p{l - p)[xt]Rt f k^\ 

(13) 

For the full calculation, we refer to Supplementary Dis¬ 
cussion 1. The mean held approach holds in the limit of 
many receptors, or when the receptor dynamics is fast. If 
these conditions are not met, readouts perform copies of 
receptor states that are correlated with the current read¬ 
out state. Detailed analysis of this regime is beyond the 
scope of this work, but a brief discussion is provided in 
Supplementary Discussion 1. 

Given the rate of copying (Eq.[^ and the rate at which 
chemical work is done (Eq. |13[ ), we are now in a position 
to calculate the chemical work done per copy cycle: 

Wchem _ jkske - k4k5)p{l - p) 

’^copy ((^3 + ^4)P + (^5 + ^e))! ~ P)) \k 4 k 3 ) 

( 14 ) 

This result can be simplihed by noting that the proba¬ 
bility a copy is made of RL is not p, but rather 

= _ f^3 + k4 _^ , 

(^3 + k4)p + (fcs + fc6)(l — p) 


Indeed, if ^3 -I- ^4 > k^ + ke, a ligand-bound recep¬ 
tor is more frequently copied than an unbound receptor 
molecule. This expression allows us to rewrite Eq.[^ as 

= {sr + SRL - l)p'{l - p')kT In f • (16) 

^copy J 

The quantity kT In has dimensions of energy and is 
equal to the chemical work done during a single phos¬ 
phorylation/dephosphorylation cycle. We can split it 
into an energy related to the accuracy of copying RL, 
Es„j^ = kT\n{k 3 /k 4 ) = kTln{sRL/{l - srl)), and an 
energy related to the accuracy of copying R, = 

kT\n{k 3 /k 3 ) = kTln{sR/{l — sr)). In these terms. 


where hflux is the average current of readout molecules 
around the phosphorylation/dephospohorylation loop; 
Api = padp — Matp and Ap2 = Lp- Tlia sum 
— (Api -I- Ap2) is therefore the total free energy dissi¬ 
pated when a readout goes around this cycle once. 

To proceed, we introduce the following averages: 


{RL) _ fci[L] (M) 

Rt k2 + fci [L] ’ [xt] 


( 10 ) 


with Rt the total number of receptors. In the mean-field 
limit, hflux follows from Eq. |^as 


^chem 

^copy 


{sr + SRL - l)p'(l - p'){Eg^ + Eg,,). (17) 


Inverting the sign of Eg, and Eg,, corresponds to a 
mirror-image encoding, in which RL is copied to x and 
i? to a;*. Low accuracy corresponds to Eg,, Eg,, -A 0, 
whensfl « l{l+Eg,/2kT), and srl « ^{l+Eg,,/2kT). 
In the symmetric case of Eg, = Eg,, = —A/i/2, the ac¬ 
curacy of copies is sr = srl = 5(1 ~ Ap/dkT). A sim¬ 
ilar analysis, with an equivalent outcome, is performed 
in Supplementary Discussion 2 for a related system in 
which receptors act only as kinases. 


Tiflux = {k3RL[x]-k4RL[x*]) = {k3{l-f)-k4f)p[xT]RT- 

( 11 ) 


Optimal devices and the thermodynamic bound. 

The biochemical network has the dynamics of a process in 
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which readouts randomly perform copies of receptors. It 
is also fundamentally dissipative - does the fact that it is 
a copy process set a practically relevant thermodynamic 
bound on this dissipation? 

We can calculate the minimal thermodynamic cost of 
the random computational copy process that is equiv¬ 
alent to our network motif. In each copy operation, a 
memory bit (in a state x* with probability /{srl, sh,p')) 
is exposed to data (in state RL with probability p') - the 
final result is a memory bit in state x* with probability 
srl or I — conditional on the data. Mutual infor¬ 
mation I is then generated between the data and the 
memory. 

After the copy step, the data (receptor) and memory 
(readout) are decoupled without loss of information. This 
means that they remain correlated, even though there 
is no direct physical interaction between the data and 
memory anymore. The fact that they remain correlated 
in the absence of direct interactions implies that the sys¬ 
tem is out of equili brium: the free energy stored in the 
correlations is Making computational copies 

thus amounts to storing free energy in mutual informa¬ 
tion, and if this information is not used to extract work 
(as done by an efficient Maxwell Demon) but simply lost 
in an uncontrolled fashion during a subsequent measure¬ 
ment, then the process is irreversible and the information 
lost sets a lower bound on dissipation. In the random 
copy process, the stored information is overwritten with¬ 
out being used to extract work, and hence kTI sets a 
lower bound on the dissipation for the entire cyclic copy 
operation, as noted in Ref. HI A copying device and a 
set of quasistatic protocols that can achieve this bound 
for various input data are given in Supplementary Fig. 1 
and analysed in Supplementary Discussion 3. 

The biochemical network considered here has the dy¬ 
namics of a random copy process, and, indeed, work is 
not extracted from the information generated. Thus its 
dissipation is constrained by the mutual information gen¬ 
erated in the stochastic copy process: 

>/=/si?Lln(^) +p'(l-SflL)ln(i^) 

+ (1-P')sflln +(l-p')((l-Sfl)ln(^i^j . 

(18) 

This bound implies an efficiency ij = (kTI)/{w/Ucopy) < 
1 , which is unity for an optimal protocol. 

The readout molecule states are persistent - the ligand¬ 
binding state of the receptor is rememb ered e ven after 
detaching, thus enabling time integratiorP32ll. Systems 
that do not make persistent copies, including passive 
readout^^, the receptors themselves and also some more 
complicated network^^, are not bound by the same limit. 
Previous work has linked energy diss ipatio n in similar 
systems to the erasure of the memorji^lIlSI^ that is the 
resetting of the memory bit to a well-defined state. As 
shown by Landauer, this erasure does lead to the transfer 
of heat from the system to the surroundings. Resetting 
a bit, however, is not intrinsically thermodynamically 


irreversibl^^EHniiniSIl _ indeed, Landauer’s calculation of 
the minimal heat transfer applies to a thermodynami¬ 
cally reversible erasure step. It is therefore the uncon¬ 
trolled loss of correlations (the failure to extract work 
from them) rather than erasure itself, which is the origin 
of the thermodynamic irreversibility of a copy cycle. It 
is also this failure that sets the fundamental thermody¬ 
namic lower bound on the work to perform copy cycles. 
In fact, erasure is not even necessary in an optimal copy 
cycle, which leaves open the possibility that the biochem¬ 
ical network, which actually contains no explicit erasure, 
could achieve the lower bound. For more details, see 
Supplementary Discussion 3. 


Trade-off between dissipation and error. The 


bound in Eq. 18 holds for all choices of srl, sr and p'. 


But how close does the biochemical network come to this 
bound, and can it be reached in certain conditions? Our 
concrete mapping allows us to examine these questions. 

Initially, we consider the simplest case of equal accu¬ 
racy {sr = Srl = s). The dissipation per copy cycle for 
the biochemical network (Eq. 171 then reduces to 


‘^chem 

^CODV 


= 2Es{2s-l)p'{l-p'), 


(19) 


with Eg = Egj^j^ = Eg^. The chemical work per copy 
cycle, rCchem/?T-copy, IS plotted against p' in Eig. ^A-B for 
two values of s (red dashed lines). We also indicate be¬ 
haviour forbidden by the thermodynamic bound of Eq. |18| 
(grey region). In Fig. [^(C), we fix p' = 0.5 and plot the 
cost per copy cycle against s for the biochemical network 
(red dashed line) and an optimal system that saturates 
the bound (solid blue line). 

Both the cost of biochemcial network and the ther¬ 
modynamic bound drop as the required accuracy is de¬ 
creased. The irreversible loss of information bounds the 
work, and lower accuracy implies less information to be 
lost. It is clear from Fig. [^(C), however, that the two 
systems have a very different tradeoff between dissipa¬ 
tion and accuracy. The required free-energy input for 
the biochemical case diverges as s —?> 1 (wchem/’T'copy ~ 
—2p'(l — p')fcTln(l — s)), whereas the dissipated work 
remains finite in an optimal system. It is clear that a 
cell will have to sacrifice some accuracy for the sake of 
efficiency. Even in the limit of low accuracy (s —>■ |) and 
at p' = 0.5, the biochemical network is twice as costly as 
the bound (??—>■ | from above). Expanding Eqs.! 

2 ^chem/^copy 


18 


19 


for 


and 


4fcr(s — 2 )^ for the 


biochemical network and u>chem/?^copy ~ 2kT{s — |)^ for 
the optimal device and protocol. 

The fundamental difference between the biochemical 
network and an optimal protocol is illustrated in Eig.|^ 
An optimal protocol requires reversible quasistatic ma¬ 
nipulation of energy levels. In the biochemical network, 
however, the energy levels are fixed. Only the barrier 
heights and hence the reaction rates are modulated, due 
to the presence or R or RL. As a consequence, biasing 
transitions is more costly in the biochemical system. 
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FIG. 2. Trade-off between dissipation and accuracy in copying. The (chemical) work per copy cycle for different probabilities 
p' of attempting to copy RL is plotted at two different values of the measurement accuracy: (A) s = 0.80; and (B) s = 0.99. 
Note that p' = p, the probability that a receptor is in state RL, if fca -|- fc 4 = k^-\- k% when the sampling rate for the two receptor 
states is the same. The biochemical implementation (red line) does not achieve the lower bound for a measuring device, which 
is the border of the shaded region. The blue lines correspond to quasistatic protocols for the device in Supplementary Fig. 1 
(see Supplementary Discussion 3) and are optimal for a specific value of p', p'*, at the given values of s. (C) Dissipation per 
copy cycle at p' = 0.5 as a function of s for the biochemical network (dashed red line) and a system that saturates the bound 
(solid blue line). The solid blue line [p'* = 0.5) saturates at fcnn2 for perfect accuracy (s = 1) and the cost of the canonical 
biochemical motif diverges. 


This difference is particularly intuitive in the limits of 
low and high copy accuracy for p' = 0.5. For the bio¬ 
chemical network, the difference between the fraction of 
correct copies s, and the fraction of incorrect copies 1 —s, 
is 2(s — 1). For p' = 0.5, in only half of these cases does 
a reaction occur and so the net number of reactions in 
the intended direction is s — | per copy cycle. Each of 
these net reactions has a cost given by the driving free 
energy. Eg « 4(s — \)kT, giving an overall cost per copy 
cycle of Wchem/''T'copy ~ AkT{s— i)^. For an optimal qua¬ 
sistatic protocol, a similar argument can be constructed 
(see Supplementary Discussion 3), but the average energy 
offset between the states at which a transition occurs is 
less than the final offset required by the accuracy. Eg. In 
the limit of Eg —)■ 0, we obtain an average cost of Egj2, 
rather than Eg as for the biochemical network. 

In the limit of high accuracy, s —> 1 and Eg ^ oo, 
a similar analysis shows why the biochemical network is 
much less efficient than an optimal quasistatic protocol. 
The cost of each transition continues to grow linearly 
with Eg in the biochemical network, explaining the di¬ 
vergence of chemical work with s. In an optimal pro¬ 
tocol, however, the work saturates when Eg ^ kT. In 
this case, the energy difference between the two states is 
raised quasistatically to Eg. Therefore at each moment 
in time, the bit is in equilibrium. As E is raised, the 
probability that the bit is in the high energy state de¬ 
creases, until it rapidly becomes negligible when E > kT 
and the higher energy state can be raised further without 
any additional cost. 

As is evident from Figs.[^(A,B), the cost of both an 
optimal system and the biochemical network decrease as 
we move away from p' = 0.5 at fixed s. Fixed accuracy 
measurement results in less information if the data itself 
has low entropy, explaining the reduction in the bound. 
For the biochemical network, if the readout is exposed 
more often to one receptor state, it is more likely to be 
in the appropriate output state prior to a copy. Hence, 


fewer transitions are needed and less dissipation occurs. 
For an alternative network in which receptors only func¬ 
tion as kinases, this automatic compensation only occurs 
at low p (see Supplementary Discussion 2). 

In Supplementary Fig. 2 we plot the efficiency rj as 
a function of p' ^ 0.5 and s = sr = srl', V is actually 
highest for p' —>■ 0 or 1 (when f] ^ from below for all 
s), and lowest at p' = i. This intrinsic adaption is fun¬ 
damentally different from the behaviour of typical com¬ 
putational devices such as that in Supplementary Fig. 1. 
For these devices the protocol must be adjusted as p' is 
varied, via a parameter related to decorrelation, to make 
the system optima (see Supplementary Discussion 3 for 
further details). Figs.[^(A,B) show, with blue lines, the 
work per copy of three fixed protocols for the device in 
Supplementary Fig. I. These protocols are optimal for 
p'* = 0.3, 0.5 and 0.7, respectively, and the lines are tan¬ 
gent to the forbidden region at these values of p', but 
above it elsewhere. It is not clear whether an alternative 
protocol exists that reaches the bound for all p' without 
any adjustment of a parameter, though it seems unlikely. 

None of the “optimal” protocols for the device in Sup¬ 
plementary Fig. 1 perform better than the biochemical 
network for all values of p'. Given knowledge of p', a sys¬ 
tem using quasistatic manipulations could reach the lim¬ 
iting efficiency of a copying device that does not extract 
work from the correlations it generates. In our cellular 
context, however, p' is the quantity that the system is 
trying to measure (rather than the state of individual re¬ 
ceptors given a known p'). The best that could be done, 
therefore, would be to pick a particular protocol, and 
update it as more information became available. Clearly, 
however, implementing this behaviour in an autonomous 
device would require substantial additional complexity. 

We have shown that the biochemical network cannot 
reach the fundamental limit of efficiency for sr = srp, 
regardless of the system parameters. Neither reducing 
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FIG. 3. Schematic comparison between the (free-) energy 
landscapes of the biochemical network (A) and a thermody¬ 
namically optimal protocol (B). In (A), a ladder of states 
exists, with successive rungs related by the turnover of a sin¬ 
gle ATP (implicit in this figure, explicit in Fig.[^. In the 
absence of an enzyme, all transitions are slow (indicated by 
the dashed free-energy barrier connecting the states). In the 
presence of R, half of the transitions are catalysed and can 
occur rapidly (solid lines); when exposed to RL, the alterna¬ 
tive transitions are rapid. The heights of the rungs, however, 
are fixed, meaning that all transitions involve a hxed amount 
of chemical work equal to the offset. By contrast, for the op¬ 
timal protocol in (B), switching between states is driven by 
slowly destabilizing one state with respect to the other, so 
that the majority of transitions have already occured before 
the offset approaches its limiting value. 


the reaction rates nor the thermodynamic driving per¬ 
mits 77 > 1/2. It is sometimes assumed that systems un¬ 
der steady-state forcing are effectively quasistatic or re¬ 
versible in the limit of this forcing being wealP^- clearly 
that is not the case here. For sji 7 ^ sjil, it is possi¬ 
ble to obtain an efficiency ry > 1 / 2 , and rj can even ap¬ 
proach unity for extreme values of p', sr and srl (see 
Supplementary Fig. 3 and Supplementary Discussion 4). 
Nonetheless, it remains true that ry cannot converge on 
unity for general p', regardless of how sr and srl are var¬ 
ied. The biochemical network is more dissipative than an 
optimal process, because operating at a constant thermo¬ 
dynamic driving force leads to more energetically expen¬ 
sive transitions than slowly manipulating energy levels. 

Thus far we have emphasized differences between the 
thermodynamic bound and the cost of the biochemical 
protocol. It is remarkable, however, that the biochemical 
network comes so close to the lower bound, even at fairly 
high accuracies. Reaching 99% accuracy for p' = 0.5 re¬ 
quires less than four times the dissipation of the lower 
bound, and rj is even higher for p' 7 ^ 0.5. Further, this 
efficiency can be achieved at an arbitrarily high rate of 
copying - the absolute rates do not enter the expression 




FIG. 4. Copy processes of increasing complexity. (A) A 
simple one-step process in which the i?L-bound state is as¬ 
sumed short-lived. (B) A tightly-coupled process in which 
the readout passes through multiple states whilst bound to 
the receptor, but only one pathway is possible. (C) A more 
general process in which multiple pathways between x and x* 
exist. Each different path could be associated with different 
entropy increases in the environment. 


in Eq.[^ Through our quantitative mapping, we have 
shown that a physically reasonable model system oper¬ 
ating autonomously at an arbitrary rate and with a high 
copying accuracy comes close to the fundamental ther¬ 
modynamic limit on the cost of a copy process. 

One-step copy processes are maximally efficient 
for the biochemical network. The push-pull network 
considered so far is a one-step copy process, with the 
conversion between the phosphorylation states occurring 
via a single instantaneous transition. We now show that 
more complex processes, involving multiple steps or path¬ 
ways, cannot improve the trade-off between energy and 
precision in autonomous, steady-state systems driven di¬ 
rectly by an out-of equilibrium chemical fuel. We again 
consider Markov process with discrete states, but we now 
allow for multiple states and multiple parallel pathways 
between x and x* (see Fig.|^. We explicitly consider 
the interconversion of x and x* by RL; an equivalent 
argument holds for reactions mediated by R. The au¬ 
tonomous requirement prohibits external control and im¬ 
plies that transition rates are fixed. 

In the simple one-step process, the accuracy of copying 
RL is Srl = k^Kk^+ki). In a more general process, how¬ 
ever, transitions between x and x* are not instantaneous 
and hence cannot be described with rate constants. The 
natural generalisation is to consider the flux from x to 
X* and vice versa, which is defined as the rate at which 
trajectories leave x and subsequently reach x* before re¬ 
turning to x^^ The fluxes determine the copy accuracy 
in the same way as the rate constants for the one-step 
process, since they represent the rate receptors initiate 
correct and incorrect transitionsbetween x and x*. 

Consider the accuracy of copying RL, srl = 4'3/{4>3 + 
(f’i) = l/(l + /' 4 //' 3 )) which depends solely on (/> 3 /(/> 4 . We 
define the set of paths S that start by leaving x and reach 
X* via an interaction with RL without returning to x. 
The entropy change in the environment AS'env[-z(f)] due 
to each of these individual trajectories z{t) is related to 
the probability of observing the forward pathway, and its 
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reverse 5 (t)®Hl 6 l 

A^env [z{t)] = fcIn a;*]) " 

Here AS'int is an intrinsic entropy difference between 
macrostates x and x* that arises because discrete bio¬ 
chemical states contain multiple microstates. It is a prop¬ 
erty of the states x and x*, rather than the transitions, 
and will eventually cancel out in our analysis. We can 
then divide S into subsets Si according to AS'env[-z(0]; 
all paths within Si generate the same entropy in the en¬ 
vironment, ASl^^. Conceptually, different subsets might 
correspond to pathways that consume different amounts 
of ATP. A simple one-step pathway, as considered hith¬ 
erto, is a special case of a system with a single Si. 

The probability of observing any pathway within Si 
given an initial state z = a; is 

P(5,|a;) = piz{t)\z{0) = x). ( 21 ) 

Similarly, P{S-^\x*) = = ^*) 

probability of observing a reverse of a pathway in Si 
given an initial state x*. By design, + AS'int = 

k\n{P{Si\x)/P{S-i\x*)). Further, the ratio of forwards 
to backwards fluxes is given by 

h ^ ’EjPjS^lx) 

-^4 j:.p{s-.\x*y ^ > 

An immediate consequence is that in systems with only 
one subset of S, in which all ASenv = associated 

with a transition are the same, the ratio of forwards to 
backwards fluxes is fixed by 

fc In ^^ = AS«f ‘ + ASint. (23) 


Such a process is “tightly coupled”; in our framework, 
being tightly coupled implies that each transition x to 
X* is associated with the breakdown of the same number 
of ATP molecules. Regardless of the number of inter¬ 
mediate steps or possible pathways in a tightly-coupled 
process between x and x*, the copy accuracy is unam¬ 
biguously determined by the entropy change in the en¬ 
vironment. Thus all tightly-coupled process with the 
same accuracy are associated with the same dissipation. 
Since the one-step process we have considered hitherto 
is tightly-coupled, all tightly-coupled processes have the 
same accuracy/efficiency trade-off as we have derived. 

It is also possible to consider processes that are not 
tightly-coupled, with multiple subsets Si. In this case, 
the average entropy deposition per x ^ x* transition is 
given by 


A5, 


multi 

env 


\p^{S.,\x*) 


( 24 ) 


where we used the “m” superscript to denote probabili¬ 
ties for this specific multi-subset syste m. The accuracy of 


22 


4>T'^y4>] 


multi 

’4 


this process is still determined by Eq.; 
yy^P'^{Si\x)/P'^{S-i\x*). We can compare this multi¬ 
subset process to a tightly-coupled process with the same 
accuracy: The key point is 

that since entropy generation and accuracy are unam¬ 
biguously related for a tightly-coupled process (Eq. 231, 
any tightly-coupled process between x and x* with the 
same accuracy as the multi-subset generates 

^multi 

A^‘ifv“ + A5i„t = = fc In = fc In 


/.multi 
C>4 


T.^P’^{S-^\x*y 

(25) 

Combining Eqns.[^ and thus yields 

A^rr^ - = E 

(26) 

where the subscript N indicates normalization: 
P^(5,|a:) = P'^{S,\x)/Y.jP'"{Syx), and PY(‘5-iY*) 
is defined equivalently. Eq.[^ is a Kullback-Leibler 
divergence between the families of paths taken by 
forwards and backwards transitions in the multi-subset 
process, and is therefore necessarily non-negative. Thus 

Asyy^^^ - A^Y®v“ > 0. 


(27) 


Similarly, paths that begin and end in x or x* deposit 
no entropy for a tightly coupled process, whereas the en¬ 
tropy deposition of these paths is non-negative in gen¬ 
eral. For a system operating in steady state, increased 
entropy deposited into the environment implies less ef¬ 
ficiency. Therefore no process of a given copy accuracy 
is more efficient than a tightly-coupled one. The limits 
derived for a one-step process, a special case of a tightly- 
coupled process are therefore general. 

Our derivation is related to the proof that the esti¬ 
mate of dissipation obtained from the irreversibility of a 
coarse-grained traj ectory gives a lower bound on the true 
entropy generation. E’^F® ! The results are fundamentally 
distinct, however. We could consider coarse-graining a 
complex copy process so that all states were now x or a;*. 
However, the result would not be a simple one-step pro¬ 
cess with the same accuracy and lower dissipation; the 
dynamics would be non-Markovian and reflect the un¬ 
derlying complex process. To state that a true one-step 
process could reproduce the accuracy at the same cost as 
estimated from the coarse-grained description would be 
to assume the conjecture that is to be proven. 


Biochemical implementation of an optimal de¬ 
vice and protocol. We have argued that no cellu¬ 
lar biochemical network, operating autonomously and 
directly powered by a non-equilibrium fuel supply, can 
reach the thermodynamic bound on efficiency for gen¬ 
eral input data. We now consider whether this is a fun¬ 
damental property of biochemical reactions, or whether 
biomolecules could in principle act as thermodynamically 
optimal bits. 













There are two principal differences between cellular A 
biochemical network and optimal system s such as that 
in Fig. 1^ and Supplementary Fig. lEMHI. Firstly, cellu¬ 
lar networks operates continuously, rather than taking a 
series of discrete measurements with external clocking. 
Secondly, as emphasized above, they involve no manipu¬ 
lation of energy levels. 

Concerning the first difference, in Supplementary Dis¬ 
cussion 5 we show that a clocked analogue of the cellular 
push-pull motif (illustrated in Supplementary Fig. 4) 
gives a work per copy identical to the continuous case. 

This is because, despite being operated in a clocked fash¬ 
ion, the device still out of equilibrium and functions at 
constant chemical potential of fuel molecules. The fact 
that cellular networks operate in a stochastic continuous 
manner rather than a clock-like fashion is not the funda¬ 
mental reason why they cannot reach the bound on the 
energy cost of a copy operation. 

Functioning out-of-equilibrium is necessary for a de¬ 
vice operating at constant chemical potential of fuel; if 
the reactions were in equilibrium, the receptor (which is 
a catalyst) could not influence the yield of xjx*. We now 
show that a system driven by quasistaic manipulation of 
ATP, ADP and P concentrations could reach the ther¬ 
modynamic bound. To operate in the quasistatic limit, 
i?L and i? must be long-lived; in practice, constitutively 
active kinases and phosphatases could be used. 

We consider a device and measurement cycle as il¬ 
lustrated schematically in Fig. The key ingredients 
are the possibility of manipulating the concentrations of 
ATP, ADP and P in the vicinity of the readout, and the 
ability to bring the readout into or out of close proximity 
with receptors. We consider the same receptor/readout 
reactions as in Eq. and now define the free energy 
changes of reaction AGp for the phosphorylation of x 
by RL and AG^ for the dephosphorylation of x* by R. 

These quantities are given by 

AGp = fj-ADP — Matp + AGj,/^;., , . 

AGd = a^p-AG,/,., 

in which quantifies the intrinsic stability of x 

and X* (assumed to be independent of the chemical po¬ 
tentials of ATP, ADP and The reactions can thus 

be manipulated by controlling /j,atp,Madp and /j,p, for 
example by coupling of the system to a series of reser¬ 
voirs (Fig. ^(B)). We outline an optimal cycle below, and 
calculate the chemical work done on the readout subsys¬ 
tem by the reservoirs (the average number of reactions 
of a given type, multiplied by the associated chemical 
work, integrated over the whole process) in Supplemen¬ 
tary Discussion 6. Throughout, we assume that receptor 
states are stable, and that reactions cannot occur with¬ 
out a catalyst. The eight steps of the cycle are intended 
to be closely analogous to the eight stages of the typi¬ 
cal computational protocol in Supplementary Fig. 1; for 
subtleties involved in this comparison, see Supplemen¬ 
tary Discussion 6. The system contains a readout (the 
memory bit) a receptor of known state R (a reference bit) 


Data bit B 



Reference bit 


FIG. 5. A biochemical implementation of an optimal device 
and copy protocol. The cycle is illustrated in (A), and the 
steps are explained in the text. The system contains two re¬ 
ceptors: one acting as a reference bit in state R for resetting 
and a receptor acting as a data bit in state RL or R. These 
receptors are attached to either end of a reaction volume. 
A readout molecule (memory bit) is tethered to the side of 
the reaction volume. The colour of the reaction volume indi¬ 
cates the chemical potential of fuel molecules in solution: it is 
dark when AGp, AGd are large and negative and white when 
AGp, AGd ~ 0 . (B) A device for implementing this cycle. 
A small reaction volume is coupled to a piston containing a 
series of reservoirs of varying ATP, ADP and P content. Sim¬ 
ilarly, the receptors and readout can be brought in and out of 
proximity by manipulation of a second piston. 

and a receptor in either R or RL (the data bit). 

The readout begins coupled to a buffer with 
AGp, AGd = —AGp {AGr assumed to be large and 
positive). There is no receptor in close proximity, but 
the readout has been reset (equilibrated at the end 
of a previous measurement cycle by a receptor in the 
R state), and therefore is in state x with probability 
1/(1 -I- exp(—AG^/fcT)). In fact, the accuracy of this 
reset does not influence the cost of the cycle (see Supple¬ 
mentary Discussion 6). 

1. The readout is brought into close proximity with a 
receptor of known state R. 

2 . AGp, AGd are slowly (quasistatically) raised from 
—AGp to 0. Steps 1 and 2 allow the state of the 
memory to be reversibly uncorrelated from the ref¬ 
erence receptor, prior to the measurement. 

3. The readout is brought into close proximity with 
a receptor of unknown state (ligand-bound with 
probability p'). AGp, AGd are then slowly lowered 
to —AGs- In this step, the state of the readout 
is set to match that of the receptor with accuracy 
s = 1/(1 -I- exp{—AGs/kT)). 

4. The readout is moved away from the receptor. 


























9 


5. AGp, AGd are set to AGoS- This stage constitutes 
the end of the copy subprocess; if the unknown re¬ 
ceptor is in state RL, then the readout is in x* with 
probability s = 1 /{1 + exp{—AGs/kT)). Similarly, 
if the unknown receptor is in state R, the readout is 
in X with probability s = 1/(1 -I- exp{—AGs/kT)). 

6 . We now decorrelate the memory and data bits at 
a non-zero bias between the two readout states, 
AGp, AGd = AGoff. To reach the fundamen¬ 
tal thermodynamic bound, AGoS must be chosen 
carefully, and depends on p' and s. To decorre¬ 
late, the readout is brought into close proximity 
with a known receptor of state R. The readout re¬ 
laxes to a state reflective of AGoff via the reaction 
R + x*^R -I- a; -I- P. 

7. The readout molecule is reset by quasistatically 
lowering AGp, AGd to —AGr from AGoff, return¬ 
ing it to a state dominated by x. 

8 . The readout is separated from the reference bit, 
returning the system to the initial state. 

The readout and receptor are in the same state at the 
start and finish of the cycle; thus the net chemical work of 
the reservoirs equates to the free energy dissipated by the 
entire system. As shown in Supplementary Discussion 6, 
minimizing dissipation with respect to AGoff at fixed p' 
and accuracy s (fixed by AGs) gives a chemical work 
per copy equal to the mutual information generated by a 
measurement. In the special case p' = 1/2, AGs —t oo, 
■u^chem fcTln2, as expected. A single readout could 
also be made to copy multiple receptors sequentially - 
using, for example, a series of receptors anchored to a 
polymer, as in Supplementary Fig. 4. We note that 
it is also possible to construct an optimal cycle if the 
receptor is only catalytically active in the RL state - see 
Supplementary Fig. 5 and Supplementary Discussion 7. 


II. DISCUSSION 

We have described a canonical cellular readout network 
rigorously in terms of computational copy operations, 
thus demonstrating that the system is indeed bound by 
the thermodynamics of computation. For a general dis¬ 
tribution of input data, the cellular network cannot reach 
this fundamental limit of efficiency. Unlike optimal com¬ 
putational protocols, the thermodynamic driving force 
used to push the memory device between its states is not 
introduced quasistatically. Instead, a continuously op¬ 
erating autonomous network must have a constant ther¬ 
modynamic driving force. Even in the limit where the 
driving force and hence the accuracy become vanishingly 
small, the cellular system is not thermodynamically op¬ 
timal. In the regime of high accuracy, the difference is 
larger: while optimal protocols can reach 100 % accuracy 
for a finite cost of fcrin( 2 ), cellular networks can only 


achieve 100% accuracy for a cost that diverges. Nonethe¬ 
less, achieving 99% accuracy for an unbiased distribution 
of input data requires less than AkT In 2 of dissipation per 
copy cycle, and the relative performance of the biochem¬ 
ical network for biased input data can be even better. 
Although kT sets an energy scale, it is not obvious a pri¬ 
ori that the numerical factors should be so low. For ex¬ 
ample, the recently-derived “thermodynamic uncertainty 
relation” , 1 ^ in which the cost of achieving a relative un¬ 
certainty e in the number of steps of a biomolecular pro¬ 
cess was shown to be at least 2kT /e^ - gives 20000 fcr for 
99% accuracy. 

Not only can this canonical cellular signalling system 
get remarkably close to the fundamental bound for effi¬ 
ciency of copying at relatively high accuracy, it can do 
so at an arbitrarily high absolute copy rate. Further, 
the system is autonomous, and so there is no need to 
consider the intrinsic costs of applying a time-varying 
yet stab le con trol to a bit, as must be done in typical 
protocol j^° * ^^ l Additionally, the canonical biochemical 
network naturally adapts to high or low levels of ligand- 
bound receptors, reducing its dissipation per copy cy¬ 
cle in a way that standard quasistatic protocols cannot 
achieve without feedback. 

We have also explained the underlying reason why our 
mapping implies a minimal thermodynamic dissipation 
per copy for the network. This is not because the mem¬ 
ory is “erased”, but because the correlations generated 
during measurement are not used to extract work. Era¬ 
sure itself is not intrinsically irreversible, and no distinct 
erasure step is present. 

Our analysis has revealed that biomolecules can in 
principle be used to implement protocols that achieve 
the thermodynamic bound, and we have provided an 
example. The key difference from the canonical cel¬ 
lular network is the manipulation of concentrations of 
ATP, ADP and P. If these manipulations are performed 
slowly enough, reactions (except decorrelation of read¬ 
out and receptor) can be performed reversibly as reac¬ 
tants/products are gradually stabilised/destabilised with 
respect to each other. 

In this work we have focussed on a single-step copy 
process. However, we have also argued that more com¬ 
plex copy processes, including multiple bound recep¬ 
tor/readout states or even multiple pathways in which 
different amounts of ATP can be consumed, cannot be 
equally accurate at a lower cost. Indeed, we have shown 
that all “tightly-coupled” processes in which all tran¬ 
sitions consume the same amount of chemical fuel are 
equally efficient, and all others cannot perform better. 
At a physical level, it is intuitive that complex process 
can be less efficient - they naturally allow for dissipative 
cycles. Similarly, longer pathways are not helpful because 
one cannot make an irreversible process less irreversible 
by breaking it into many small steps whilst keeping the 
overall driving force fixed. For processes such as kinetic 
proofreading, however, in which chemical fuel drives reac¬ 
tions out of equilibrium, complex or longer reaction path- 
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ways can allow the same outcome at a lower costPSISIIlSI 
The crucial difference is that in a copy process, the metric 
of accuracy is exactly the ratio of forwards to backwards 
transition probabilities, which is the quantity directly in¬ 
fluenced by fuel consumption. More fuel consumption 
always improves the metric. For other tasks, fuel con¬ 
sumption is necessary, but other factors can also influ¬ 
ence performance. In kinetic proofreading, the relevant 
metric is the rel ative occupancy of a binding site by two 
different ligandsThis quantity is influenced by the 
strength of chemical driving, but it is also limited by the 
number of intermediate states at which there is an op¬ 
portunity to discriminate between ligands. Thus it can 
be beneficial to consider multi-stage proofreading. 

Our analysis of the cellular network used the mean- 
field limit, which becomes accurate when the receptor 
correlation time Tc is shorter than the relaxation time r^. 
of the readout network. Interestingly, this is precisely 
the optimal regime for sensin^^, because the system can 
take multiple /tc > 1 concentration measurements per 
receptor molecule. In this regime, the readout molecules 
do not track the fluctuations in the receptor state, but, 
collectively, average it. As a result, the “learning rate’^S 
between the readout and the receptor is actually zero in 
this limit (see Supplementary Discussion 1). While the 
opposite regime t^/tc < 1 is detrimental for the mecha¬ 
nism of time integration, we do note that the work per 
measurement is less. This is because in this regime the 
measurements become correlated, and taking correlated 
measurements requires less work for a given desired ac¬ 
curacy. A full analysis of this regime, as well as the 
consequences of spontaneous reactions not mediated by 
kinases and phosphatases, is the subject of further work. 

The fact that cells employ thermodynamically in¬ 
efficient out-of-equilibrium circuits highlights the con¬ 
straints under which they function. Cells do not have 
infinite time to perform a measurement, meaning that 
quasistatic manipulations are infeasible. Further, our 
quasistatic protocol requires that a readout molecule ex¬ 
clusively encounters either ligand-bound or ligand-free 
receptors during each copy process. This requires that 
the ligand and ligand-free states of the receptors are sta¬ 
ble on the time scale of the measurement cycle, and also 
that receptors in both states are not accessible to a single 
readout. In reality, the finite lifetime of receptor states 
places limits on the measurement time, and cells typi¬ 
cally have multiple receptor molecules with which any 
one readout molecule can interact. The quasistatic cy¬ 
cle also necessitates coordinating the separation between 
readouts and receptors; although this is not inconceivable 
within a cell, it would require elaborate machinery. Per¬ 
haps most importantly, however, the quasistatic protocol 
requires manipulation of the concentration of chemical 
fuels; this manipulation must change chemical potentials 
by several kT to be effective. Given the relatively small 
number of chemical fuels available, and their extensive 
use in a range of systems, it would be very surprising if 
the cell manipulated chemical potentials purely for the 


sake of measurement efficiency. 

The final observation may also explain why cells use 
such a strong chemical driving (the hydrolysis of ATP 
typically provides approximately 20kT, deep into the low 
error regime), rather than more efficiently taking mea¬ 
surements of only slightly lower accuracy^S. Indeed, the 
most efficient strategy from the perspective of sampling 
is to make many low-accuracy copie^^, this, however, 
requires time (for the measurements to be independent) 
and readout molecules (to store the measurements), re¬ 
sources which are not free for the cell. 

Although motivated by the time integration of recep¬ 
tors by readouts, our analysis is also potentially appli¬ 
cable to other systems in which signals are propogated 
via chemical modification. Further, these ideas are also 
germane to synthetic biology and biological engineering 
in which the constraints and goals are distinct from those 
of natural systems. Additionally, we propose a new class 
of systems in which the fundamental thermodynamics 
of computation can be explored, to complement experi¬ 
ments done with optical or electrostatic feedback trap^^ 
and magnetic system^. Our approach involves manipu¬ 
lating biomolecules by adjusting the chemical potential 
of fuel molecules. Our experimental system is particu¬ 
larly promising because the dissipation could in principle 
be measured directly for a large number of devices acting 
in parallel rather than inferred from positional trajecto¬ 
ries as it is done for the optical or feedback traps. In 
an experimental realization, it would be natural to treat 
the reservoirs and memory together as an extended sys¬ 
tem thermally coupled to the outside world. In this case 
the chemical free energy dissipated during measurement 
is not equal to the heat exchanged between the extended 
system and the outside worlcP^ - the increased entropy 
of the universe is instead manifest in a less uneven distri¬ 
bution of ATP, ADP and P between reservoirs. It would 
therefore be most natural to measure dissipation through 
the changing concentrations of ATP, ADP and P as the 
reservoirs exchange molecules - perhaps through radioat- 
ive labelling of phsophates. Further, it should be possible 
to perform full measurement cycles and probe the link 
between information loss and irreversibility. By measur¬ 
ing the state of the readout using, e.g., FRET, it would 
also possible to test the generalized Jarzinksy equality, 
which shows that the state of the system can be changed 
more efficiently by exploiting knowledge of the state of 
the systenPHaa, 

III. METHODS 

We analyse the biochemical network at the level of the chemi¬ 
cal master equation. Reactants are assumed to be well-mixed and 
reactions occur with a propensity given by the concentration of 
reactants multiplied by a rate constant. Concentrations of ligand, 
ATP, ADP and P are assumed constant, and bound states of recep¬ 
tors and readouts short-lived. The master equation then reduces to 
instantaneous second order reactions between receptors and read¬ 
outs, and first order interconversion of receptor states. We calculate 
the dissipation of the biochemical network at steady state and in 
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the mean-field limit, thereby assuming that receptor and readout 
states are uncorrelated. In the steady state, the dissipation rate is 
equal to the rate of chemical work done by the fuel (the net reactive 
flux multiplied by the drop in chemical potential of fuel molecules). 
The full master equation, the mean-field limit, and a discussion of 
the validity of this approximation are provided in Supplementary 
Discussion 1. 

We compare the biochemical network to a two-state system in 
which computational operations are performed by external manip¬ 
ulation of the energy levels. An example is provided in Supplemen¬ 
tary Fig. 1, and analysed in Supplementary Discussion 3. I n this 
case, the work done during a protocol A(t) can be calculate d'^^ l ^^ l 
as f diXOl-L/dX, in which "H is the internal energy of the system. 
This integral is simple to calculate for a two-state system in which 
manipulations are quasistatic, as demonstrated in Supplementary 
Discussion 3. In the case of the quasistatically-manipulated bio¬ 
chemical system, dissipation is calculated chemical work performed 
in one cycle (see Supplementary Discussion 6). 

The dissipation is bounded by the mutual information I gen¬ 
erated between the data and memory by copying. The mutual 
information between two stochastic variables X and Y is given by 
I = T,^,yPi^ = x,Y = This quantity is 

trivial to calculate for given input data and copy accuracy. 
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Appendix A: The master equation of the biochemical network 


The model presented in Eq. 2 of the main text defines a master equation for the variables x* and RL, the number 
of phosphorylated readouts and ligand-bound receptors, respectively. 

= -([L]fci(i?T - RL) + k 2 RL)P{x*,RL) 


+ [L]ki{RT - (RL - l))P{x*,RL - 1) -h k 2 {RL + l)P{x*,RL + 1) 
- {{ks^ + k^L^^){xT - X*) + {kiLL + P{x*,RL) 


(Al) 


+ + k 5 ^^f^){xT - {x* - l))P{x* - 1,RL) + (k^^ + + 1,RL), 

in which V is the system volume. We assume that the ligand concentration is large enough that [L] is uninfluenced 
by RL - thus ki = ki[L] is a constant. We also assume that bound states formed between receptors and downstream 
proteins are short-lived compared to the typical time between binding events, so that all receptor/readout reactions 
are essentially instantaneous. This is equivalent to assuming that the probability of finding the memory in between 
the wells (at the top of the barrier) is negligible in the computational device in Supplementary Figure 1. In this 
work, we are interested in the steady state. Firstly, it is trivial to see that RL does not depend on x*. Due to the 
linearisation that arises from assuming ki[L] = ki, the stationary distribution for P{RL) is a simple binomial: 


P{RL) 


fci \ “ / fc2 \ Rt\ 

fci + ^2 / \fci + fc2/ {Rt — RL)\RL\ 


(A2) 


with mean {RL)/Rt = p = ^ 1/(^1 Tfe). Finding the full solution, P{x*,RL) = P{x*\RL)P{RL), is more challenging 
due to non-linearities that remain in the reactions involving phosphorylation and dephosphorylation of x. However, 
we do note that the behaviour of each readout molecule is independent of the other readouts, and thus 

Pix*\RL) = q{*\RL)^tl - (A3) 

in which q{*\RL) is the probability that a single readout is phosphorylated in a system with Rt receptors, given that 
RL of them are ligand-bound. Note that this holds when RL flucuates in time. Even the calculation of q{*\RL) is 
non-trivial, however, and we do not attempt it here. 



13 


1. Mean-field approximation 


To derive Eqs. 11 and 12 of the main text, we make the mean-field assumption that fluctuations in x* are 
uncorrelated with fluctuations in RL. Mathematically, we assume P{x*\RL) = P{x*\{RL)) = P{x*). We combine 
this assumption with the fact that, as x* is a Id variable with reflecting boundary conditions, the total flux associated 
the increment x* ^ x* + \ must cancel with the total flux associated with x* + 1 ^ x*. At the mean-field level of 
approximation, this condition yields 


, {RL) , , Rt- (RL) 
+ -y- 


(x* + l)P{x* + 1 ) = 


By summing over x*, we obtain the following expression for (x*): 


, {RL) , , Rt- {RL) 
h— + k, -- 


, ,, f, {RL) , , Rt-{RL) 
) = Ua—^ - - - 


(xt — x*)P{x*). 


(xT - (x*)). 


and therefore 

k^p + kz{l-p) . . . 

^ [xt] {k:i + k^)p+{k^ + k^){l-pY ^ ’ 

as quoted in Eq. 12 of the main text. Further, the average flux of readouts around the cycle is given by 

hflux = y {RLx) - y {RLx*) = kspRr [xt] - {RLx*). (A7) 

In our approximation, {RLx*) = {RL){x*\RL) and thus 

hflux = {ksi^ - f) - k4f)p[xT]RT, (AS) 

as stated in Eq. 11 of the main text. For completeness, we also give the differential equations that govern RL and x* 
in the limit of deterministic chemical kinetics: 

^ = -k^RL + fci(i?T - RL), 

(A9) 

^ = -{k4RL] + keilRr] - [RL]))x* + [kslRL] + k^lRr] - [i?L]))(xT - x*). 

The steady state of these deterministic equations is given by the stationary stochastic averages in the mean-field 
approximation: RL = {RL) and x* = {x*\RL). 

The mean-field limit is valid when the number of receptors is large, so that fluctuations about the average are 
negligible (note that the number of readouts does not have to be large). The mean-field limit is also valid when the 
kinetics of receptors is fast compared to that of the readouts - in this case, P{x*\RL) = P{x*\{RL)) = P{x*) since 
the receptors fluctuate too rapidly to be tracked by the readouts. Indeed, the master equation can be readily solved 
for a single receptor and a single readout, with the steady-state solution 

py* _ Q pp = 0) = ^4 + ^2^6+^.3^6+^4^6) 

py* _ 2^ pp = 0) = ^2(fclfc3-flfc2fc5+fc3fcs+fcftfcs) 


py* _ Q pp = = fci [L](kik'^+k2k'g+k'^k'f^+k'^k',^) 

py* _ ^ pp — 2) = (^1^3+^2^5+^.3^5+^3^6) 


in which k{ = kijV, and C is a normalizing constant. In the limit ki, k 2 ^ 3 , ^ 4 , /cg, fcg, the net flux can be directly 
evaluated as 


k'^P{x* =0,RL = 1)- kiP{x* = l,RL=l) = 


consistent with Eqs. 11 and 12 of the main text. 


(*3^6 - k'ik'4p{l-p) 


(fcafcg - kikY)p{^ -p)[xt\Rt 


p{k'^ + k'Y) -b (1 - p){k'^ + k'o) p{k3 + ki) -b (1 - p){k5 + ke) ’ 

(All) 
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Finally, the mean field limit also holds if the receptors do not fluctuate at all - i.e., if RL is fixed over time at a 
given value. Such a system (which would be a poor concentration sensor) would be equivalent to a system with a fixed 
number of constitutively active kinases and a fixed number of constitutively active phosphatases. In this case, the 
fixed value of RL/Rt determines p, rather than the average of the ligand binding and unbinding dyamics. However, 
the equations for x* and hflux hold exactly given this value of p. Note that this situation is not equivalent to slow but 
finite receptor dynamics, in which RL still fluctuates for a given system. 


2. Mapping to the copy process does not require the mean-field approximation 

The mapping between an abstract system that performs copies and the biochemical network in Eqs. 3-5 of the 
main text was performed at the level of transition rates in the Markov model. It is therefore not dependent on a 
mean-field approximation. We also note that the expression for the average rate at which copies are made, 

«copy = {kcopyXT + k^opy^r) = ( + ^4 ^5 + ^6 ^ fee) (1 - p))i?T [s^t] , (A12) 

does not rely on a mean-field approximation. 


3. Violation of mean field behavior 

For low numbers of receptors and slow receptor dynamics, the approximation P{x*\RL) = P(a;*|(i?L)) = P{x*) 
breaks down. In this case, (RLx*) > {RL) {x* \ RL), since a greater number of ligand-bound receptors leads to more 
phosphorylation. The result is that hflux is lower than under the mean-field assumption. Since the expression for the 
rate of copying remains valid. 


Wchem ^ (fcflfce - kik^)p{l - p) 

»4copy {{k^ + k^^)p + {k^ P kQ){l - p))^ \kAk^) 

Physically, the cost of a copy cycle is reduced because the state of the readout is correlated with the receptor prior 
to copying. This means that fewer actual phosphorylation/dephosphorylation events occur per copy cycle, and hence 
less dissipation is necessary. Our comparison to copying, and the calculation of the information generated, assume 
the measurement of uncorrelated data bits; a lower cost for copying of correlated data bits can also be achieved with 
the ideal device in Supplementary Fig. I. To do this, the decorrelation step 6 in Supplementary Fig. 1 must not be 
allowed to run to completion - the barrier between wells should be raised after a certain finite time. The memory 
bit should then be exposed directly to the next data bit (step 3 in the quasistatic protocol), and the barrier between 
wells only lowered when the strength of interaction between memory and data bits reflects the correlation between 
successive data bits. For example, if the correlation between subsequent data bits tends towards unity, the optimal 
protocol would retain the memory bit in the state of the previous data bit by reducing the time spent in step 6 
towards zero. The memory bit would then be exposed to the data bit, but the barrier between wells is not lowered 
until the coupling is extremely strong. In the limit of perfect correlation, this procedure would require no work. A 
full exploration of the correlated regime is beyond the scope of this paper, due to its additional complexity. We note 
in passing that correlated copies of the receptors are not helpful in sensinj^, and indeed the readout molecules need 
to respond slowly in order to perform time integratiorf^. The regime in which the readout transitions are slower 
than those of the receptor state not only makes the mean-field analysis accurate, but is also the biologically relevant 
regime. 


4. Learning rate in the mean-field limit 

The “learning rate”l^ gives the rate at which transitions in one part of a bi-partite reaction network act to increase 
the mutual information between that part of the network and the other part. The most obvious application to a 
sensory system such as discussed in our manuscript would be to calculate the learning rate that results from the 
biochemical network’s response to changes in the concentration \L\. This would be highly relevant to the quality of 
the network as a sensor of dynamic concentrations. However, it is not of direct relevance here, as we are concerned 
with whether the action of the readouts can truly be related to computational copying, and if so, how the efficiency 
relates to thermodynamic bounds on copying. This learning rate would depend both on the steady-state distribution 
and dynamics of [L\. The receptors and readouts also form a bi-partite system, and hence a learning rate can be 
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defined here. The parameters of the model specify a learning rate, but to calculate it would require the evaluation of 
P{x*, RL). In the mean-field limit we can show that the learning rate is zero. The learning rate i^^ 


i = - Y. u^^^in 

X* ,RL x*'^x* 


f P{x*,RL) \ 
\Pix*',RL)) ’ 


(A14) 


which, using our mean-held approximation, can be re-written as 


X* x*'^x* 


( Pjx*) 

\P{x*') 


(A15) 


in which = J 2 rl PiPP)'^x^x-‘'■ The variable Wx*,x*' is only non-zero for adjacent values of x* and x*'. 

Further, in the steady state, the average how from x* to x*' must cancel. Therefore we obtain a balance condition 
that holds for all a;*, a:*': 


P{x*, RL)w 


RL 


-P{x*',RL)w!^P,=0. 


RL 


Applying the mean-held approximation to the above expression gives 

P{x*)Wx^,x^> =P{x*')Wx*,^x^. 

Substituting Eq. A17 into Eq. |A15| gives 


I = Y Pix*Wx^,-^' In (Pi^*)) - P{x*')Wx>',x> iPix*')) = 0. 

X* .X*' ,X*'^X* 


(A16) 


(A17) 


(A18) 


The fact that the learning rate between receptors and readouts is zero in the mean held limit is consistent with 
the argument in the previous subsection. In the mean held limit, receptors are perfectly time-averaging, instead of 
responding to huctuations; the learning rate reports the degree to which x* responds to huctuations in RL. 


Appendix B: Receptors that only act as kinases in the canonical biochemical network 


In the main text we considered a canonical biochemical system in which receptors were bifunctional - RL acted 
as kinases, and R as phosphatases. An alternative is a system in which RL act as kinases but R are inactive, and 
const it utively active phosphatases exist within the cell. At the level of second-order rate constants, our reactions are: 


RL -I- a: ^ RL + x*, 

k4 

P + X* ^ P + X. 


(Bl) 


In this system, it is not the state of the receptor that is copied into the readout, as R is completely passive. Instead, 
enzyme identies RL or P are copied into the state of the readout as x* or x respectively, in the same way that RL 
and R were copied in the bifunctional system. The relative frequency with which each copy happens can be used to 
estimate ligand concentration. Eqs. 11 and 12 of the main text still hold, but the average fraction / of phosphorylated 
readout in the mean-field limit is 


r _ _ hpRr + hPr _ 

(fca -I- k4)pRT + {h + ke)PT ’ 

in which Pr is the total number of phosphatases. Thus 

_ {kske - k4k5)p[xT]PTRT 

Wchem \k 4 k 5 ) 

Analogously to the system considered in the main text, the rate at which copies are attempted is 


(B2) 


(B3) 


ncopy2 = [xt] ((fcs + k4)pRT + {k^ + ■ 


(B4) 
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Combining Equations B3 and B4 


the dissipation of free energy per copy cycle in the mean-field limit is given by 


Wchem _ (fcafee - ^4 fcs )pPtRt / k^\ 

?^copy {{ks + k4)pRT + {k5 + kQ)PT)'^ \k4k5 J 


(B5) 


Further, 


/ _ (^3 + k4)RT 

^ ^ {k3 + k4)pRT + (fca -I- ke)PT 

is the probability that an attempted copy is of RL rather than P, and 

fca fee 

srl — —rq—, sp — -— 

fca + K 4 fca + ke 


(B6) 


(B7) 


are the accuracies with which RL and P are copied, respectively. 

The results are closely analogous to those obtained in the main text for the canonical biochemical network, and in 
fact are related by the transformation (1 — p)Rt —t Pt (which is perhaps unsurprising). Indeed, Wchem/^icopy is the 
same function of p' in both cases 


^11^ = (si + - l)p'(l - p'm,, + E,,). (B8) 

^copy 

Thus our analysis in the main text still holds, except for the fact that the phosphatase concentration is not directly 
controlled by varying the probability of receptor-ligand binding p. A given cell operating without bi-functional 
receptors, therefore, does not automatically adapt to high p by converting all phosphatases to kinases, and so will not 
use a vanishing free-energy per copy cycle at high p (because p' does not tend towards 1, unlike in the bi-functional 
system). Its behaviour as a function of p\ however, is identical. 


Appendix C: A typical computational device and protocol 

Here we outline a non-autonomous computational protocol for copying/measurement that reaches the ther¬ 
modynamic bound of efficiency for a process that performs copies with a given accuracy s = sp = spp = 
1/(1 -I- exp{—Es/kT)), and subsequently allows the information to be lost in an uncontrolled fashion. We shall 
initially consider a process that reaches the bound when copying data which is in state 1 with a probability p' = 0.5, 
before subsequently generalising. 

The prototypical memory device is a system capable of storing binary information reliably for a long time: a 
memory hit. The state of the memory bit can be altered by coupling it to external systems; it is important that the 
memory bit persists in the altered state after the coupling is removed. We shall also manipulate the memory bit using 
a system of known state, a reference bit, that allows the memory bit to be reset to a standard state. 

The computational device and protocol discussed here is based on that of Be nnettP . Our memory bit is a single 
particle in a double-well energy landscape, as illustrated in Supplementary Fig.ll^l^lill. A particle in the left well is 
in state 0, a particle in the right well is in state 1. Coupling the memory bit to an external bit in state 0 raises the 
right-hand well (stronger coupling implies a larger shift), whereas coupling the memory to an external bit in state 1 
raises the left-hand well. The barrier between the wells of the memory bit can be lowered and raised as desired. 


1. Special case for p' — 0.5 

The protocol is illustrated in Supplementary Fig. 1. The memory bit starts in state 0 (the left well) with a 
probability 1/(1 exp{— Er / kT)) , due to having been equilibrated by a reference bit in state 0 at a coupling strength 
Er (the accuracy of this initial reset will turn out to be irrelevant for the cost). The data bit is then copied into the 
memory bit (steps 1-5). Steps 6-8 return the memory to the standard state 0 prior to the next measurement. In step 
6 , the memory and data bits decorrelate, before the actual reset in step 7 and the raising of the barrier in step 8. 

To calculate the work done in the protocol, we make the following assumptions. We will assume that the barrier is 
large enough that hopping between the wells is negligible on all relevant timescales unless the barrier is deliberately 
lowered. We will also assume that the wells are broad and steep-sided so that the potential energy of a particle in 
either well is well-defined. Finally, we assume that raising/lowering the barrier does no work because the probability 
of finding the particle at the top of the barrier is always negligible (we are considering an ideal two-state system). 
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Couple to data 
bit in state 0 





Couple to data 
bit in state 1 



FIG. 6. Supplementary Figure 1: A canonical computational measurement/copy cycle with a particle in a double well 
potential representing a memory bit. The particle starts in state 0 (the left well) with the energy of the right well raised by 
Eo S> kT due to coupling of the memory to a reference bit. Step 1: the barrier between wells is lowered. Step 2: The memory 
is slowly decoupled from the reference bit, allowing the right well to fall. The particle remains in equilibrium between the two 
wells. Step 3: the memory is gradually coupled to a data bit of unknown state, raising either the left or the right well. Step 
4: the barrier between the wells is raised. Step 5: the memory and data bits are decoupled, completing the copy stage of the 
protocol. Steps 6-8 are performed prior to the next copy to return the memory bit to state 0. Step 6: the barrier between wells 
is lowered, allowing the particle to equilibrate between the wells. Step 7: the memory is gradually coupled to the reference bit, 
and the particle is returned to the left well. Step 8: the barrier is raised, returning the memory bit to its initial state 0. Steps 
1-5 constitute the copy protocol, step 6 is decorrelation, and steps 7 and 8 are the reseting of the memory. 
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For conceptual convenience, we begin our description in state 0, coupled to the reference bit that has been used to 
reset the memory bit after the last measurement. This choice aligns the start of our cycle with the start of the copy 
sub-process (the first five stages of the cycle). A reasonable alternative would be to start our description at step 6, 
so that our measurement procedure begins with setting the memory bit to 0 (steps 6-8) and subsequently involves a 
copy (steps 1-5). This arbitrary choice is of no fundamental importance. Proceeding step-by-step: 

1. The barrier is lowered; no work is done. 

2. The right well is slowly (quasistatically) lowered from Er to 0 as the coupling of the memory bit to a reference 
bit of known state 0 is reduced. Here, slowness is measured relative to the relaxation time of the system; 
it is assumed that at any instant, the memory is in a state that is rep resen tative of equilibrium given the 
instantaneous energy landscape. The average work done in this protocol i^23S3 

rO -E'/kT / 2 \ 

Je^ l + = ’ (Cl) 

which tends to — fcnn2 in the limit E^/kT —>■ oo. This result shows why the presence of the reference bit at 
the start of the copy is necessary. Had we instead performed steps 1 and 2 in reverse, lowering the right well 
to 0 and then lowering the barrier (equivalent to initiating the measurement without a reference bit present), 
we would have allowed the memory bit to relax freely to the 50/50 state. The work extracted would have been 
zero, increasing the total work done during a cycle (by A:Tln2 as E^/kT —> oo). 

3. The memory is exposed to the data bit; as the coupling of the memory bit to the data bit increases, either the 
left or right well rises from 0 to Eg depending on the state of the data bit. The average work done by slowly 
coupling the memory to the data bit is 

p-E'/kT f 2 \ 

which tends to A:Tln2 in the limit Eg/kT —> oo. 

4. The barrier is raised; no work is done 


5. The system is decoupled from the data bit, lowering the well that was previously raised. If Eg is finite, there is 
a finite probability of the particle being in this “wrong” well; the work done on the system is 


W5 = - 


p-E,/kT 
_ E 


(C3) 


which tends to 0 in the limit Eg/kT —>■ oo. This subprocess completes the copying stage of the measurement 
protocol; the remaining stages involve setting the memory bit to 0 ahead of the next measurement. The 
probability of performing a correct copy is 1/(1 -I- exp(— Eg/kT)) = s, as required. 

6 . The barrier is lowered. The memory bit becomes decorrelated from the data bit. The total entropy of memory 
and data bits therefore increases, but no work is done. In the limit Eq ^ oo, the memory and data bits are 
perfectly correlated prior to this step. Therefore the possible configurations of the memory and data bits change 
from (0,0) or (1,1) to (0,0), (0,1), (1,0) or (1,1), an entropy increase of A:ln2. 

7. The memory is reset by slowly coupling to a reference bit of state 0. The average work done is 

j-Er E'/kT / 2 \ 

= y, l + g-S'Ar"^^' = In y^^^_EjkT J > (<^4) 

which tends to A:Tln2 in the limit E^/kT — >■ oo. 

8 . The barrier is raised; no work is done. The memory bit has been reset to state 0 with probability 1/(1 -I- 
exp{-Er/kT)). 


The total work done during this cycle is 


p-E,/kT 

Eg. 


wt = kT In 


1 -j- Q-EJkT 


1 -(- Q-EJkT 


(C5) 
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As the state of the system is unchanged at the end of the cycle, this work is dissipated irreversibly. If p' = 1/2, and 
the accuracy of copying is sr = srl = s, the information generated by the copy process is 


I = s ln(2s) + (1 — s) ln(2(l — s)) = ln(2s) + (1 — s) In 


1 — s 


= wt / kT. 


(C6) 


The final equality follows from the definition of Eg in terms of s, Eg = fcrin(s/(l — s)) or equivalently s = 1/(1 + 
eicp{—Eg/kT)). Thus the protocol reaches the thermodynamic bound on dissipation for copying data with p' = lj2 
at an accuracy of s. 

In the main text, we contrasted the cost of the biochemical network and the optimal protocol for p' = 0.5, Sr = 
srl = s by considering the (free-) energy drops associated with the transitions of the memory. In fact, this is the heat 
associated with the transitions (or the total drop in chemical free energy of the entire system). One can derive the 
same cost per copy (Eq. C5) for an optimal protocol by considering this heat transfer instead of work. For p' = 0.5, 
Sr = srl = s, the net heat transfer is given entirely by the heat transferred in step 4; steps 2 and 7 cancel, and the 
heat is zero for other steps. This is why the intuitive discussion in the main text, which only considers the actual 
transitions of the memory by which it is set to match the data, is valid. 


2. Generalising to p' / 0.5 


It is clear from the fact that Eq. |C5| does not depend on the distribution of input data that the above protocol is not 
optimal for all p'. In this protocol, we made the implicit decision to perform decorrelation (step 6) in a state in which 
there is no bias between the two wells. We could, instead, consider performing decorrelation by first coupling the 
memory to another reference bit in state 1 with coupling strength ifoff, then lowering the barrier to allow decorrelation, 
and finally quasistatically reducing the offset to 0 prior to proceeding with the reset. 

There is no change to stages 1-5 or 7-8, in which the total work performed is 

It is now necessary to raise left well (state 0) by an energy Eog, perform the decorrelation, lower the left well back to 
0 and finally raise the right well to Eg. The work done in raising the left well is 

We. = (1 - f)EoS. (C8) 


since a particle will be in the left well after the measurement with a probability 1 —f by definition (/ = p's-|-(I—p')(I—s) 
is the probability of a measurement outcome being 1). No work is done during the decorrelation stage, and in lowering 
the left well quasistatically to 0 the work done is 


Wee = 


dE-_ 


exp(—if/fcT) 


= -fcTln 


/fioff 1 + exp(-E;/fcr) 

Summing over all contributions gives a total work during the cycle of 


I -I- exp(-ifoff/fcr) / ■ 


Wt = kT In 


1 -k 


^-Es/kT 


I -|_ g-Es/kT J 2^ _|_ Q-Es/kT 
At fixed p' and s (and hence /), this expression is minimised by 


Eg + {1- f)Eog. 


Hence the optimal decorrelation energy is 


for which the work is 


1 - f-= 0 

•' 1 _(_ (,-E^tt/kT 


El^ = kTHf/{l-f)), 


= kT {sins + {1 - s) ln(I - s)) - kT (/In/ -k (I - /) ln(I - /)). 


(C9) 

(CIO) 

(CIl) 

(CI2) 

(CIS) 
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In other words, decorrelation should be performed at a bias that reflects the measurement outcome. We can show 
that this optimal saturates the thermodynamic bound; in this case, the mutual information generated by the 
copy is 


I =p'sln ( +p'{l - s)ln + (1 -pOsln “ s) In 


(C14) 


Using / = p's + (1 - p')(l - s). 


I = (slns + (1 - s)ln(l - s)) - (/ln/ + (1 - /)ln(l - /)) . 


(C15) 


Comparing Eq. C15 with Eq. C13 shows that rulp = kTI, and therefore that the protocol with appropriately chosen 


reaches the thermodynamic bound for copy cycles if the resultant correlations are not used to extract work. 

We note that raising the left well by a negative value of Eoff to perform decorrelation, in this picture, can only 
mean raising the right well by —Eos by coupling the system to a reference bit in state 0. An analogous derivation to 
that presented above holds for this operation, so the result holds for both positive and negative Eoff. In this analysis, 
we haven’t considered unequal sr and srl] it is not obvious how to adjust this protocol to achieve unequal accuracy. 


3. Cause of dissipation in the optimal protocol 

As mentioned in the main text, it is actually the decorrelation step that is thermodynamically irreversible, rather 
than resetting. This is particular clear since the reset energy E^ does not appear in the thermodynamic cost. The 
cause of irreversibility is the failure to extract work from the combined memory and data bit system as it relaxes 
from a state more likely to be (memory, data) = (0,0) or (1,1) than (0,1) or (1,0) to one equally likely to be in (0,0), 
(0,1), (1,0) or (1,1). This is directly analogous to the irreversibility of the free expansion of an ideal gas. A procedure 
that did extract the maximum work from this decorrelation would be fully reversible, and the overall entropy change 
of the universe during the cycle would be zero, corresponding to the best possible performance of Maxwell’s demon^. 
Reversing the copying procedure using the same data bit, or “uncopying”Ii^, uses the initial correlation between 
bits to provide the necessary work to reset the memory bit and is therefore an example of extracting work during 
decorrelation. 

Not only is resetting reversible, it is unnecessary in an optimal cycle. Steps 1 and 2 are the reverse of steps 7 and 
8 : the initial part of the measurement protocol reverses the reset to the standard state of 0. Thus these steps are 
unneccesary, and a cycle of steps 3-6 constitutes an equally valid measurement or copy protocol. As the work required 
for 7 and 8 exactly cancels the work returned by steps 1 and 2, the total work done in this abridged protocol is still 
given by Eq. |C5[ 


Appendix D: Sampling with unequal accuracies 

Supplementary Figure 2 shows that r] < ^ tor all p' if sr = srl = s. In Supplementary Figure 3 (A-C), we plot 
efficiency 77 as a function of independently varying sr and srl for p' = 0.5, 0.8, 0.98. As p' becomes more extreme, a 
region of 77 > ^ appears and grows, with the maximal value of 77 found at sr « 0.5 and srr « p'. Indeed, if we plot 77 
against p' with sr — 0.5 and srl = p', as in Supplementary Figure 2 (D), we see 77 approaching unity as p', srl —t 1. 
For 77 to get close to unity, however, requires extreme values of the parameters. We also note that 77 —1 does not 
occur when the dissipation in a phosphorylation-dephosphorylation cycle is low, and is independent of the absolute 
reaction rates. 

In the main text, we gave an intuitive explanation of why an efficiency of 0.5 is limiting in the case sr = srl, 
p' = 0.5. This was based on considering the heat deposited into the environment (or the total drop in chemical free 
energy) by the actual transition of the memory bit. A similar analysis applies here, but when p' « srl —> 1 and 
Sr « 0.5, the largest contribution to the overall cost comes from copy cycles involving RL rather than R. For the 
biochemical network, measuring RL involves srl — f transitions per measurement with a free-energy jump of E^j ^^. 
For the optimal protocol, the key process is now raising the left well quasistatically from E^g to Egj ^^, resulting in a 
transfer of srl — f particles at an average energy jump of E^g < E < Egj^^. When p' « srl —t 1 and sr « 0.5, and 
unlike the sr = srl, p' = 0.5 case, E^ff and Eg^^^ are both large and similar in magnitude. This fact means that the 
deposited heat is close to Eg^^ and hence the efficiency is close to unity. 
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FIG. 7. Supplementary Figure 2: Efficiency r; as a function of p' and s = sr = srl- Note that p does not exceed 0.5, and 
that moving p' away from 0.5 always increases p at fixed s. A mirror image is obtained for p' < 0.5. 


Appendix E; A clocked biochemical device and protocol 


We illustrate a discrete biochemical copy process in Supplementary Figure 4. The readout starts in a state x or 
X* determined by the previous measurement. It is then brought into proximity with a receptor (uncorrelated with 
the previous one) so that catalysis can occur. The readout is allowed to reach a steady state, then removed from the 
proximity of the receptor. 

In this protocol the data and memory are allowed to interact for some period of time and the final result is taken 
to be the output of a single copy. The receptor state should remain constant in this period (in practice, constitutively 
active kinases and phosphatases could be used). By contrast, in the cellular network it is most natural to consider 
a receptor that switches rapidly on timescale of readout modification. Nonetheless, as we show next, provided that 
receptors are uncorrelated from one measurement to the next and are presented to readouts with the appropriate 
relative frequency, the cellular and clocked biochemical protocols have the same measurement outcomes and efficiency. 

If a series of measurements are performed on uncorrelated receptors using the clocked protocol, the same phospho¬ 
rylation fraction / as in the cellular network (Eq. 12 of the main text) will be obtained, provided that receptors in 
the ligand-bound state are present with a probability p'. The accuracy of copies is still given by Eq. 7 of the main 
text, because sr and srl quantify the steady state probabilities of finding x* when the receptor is in state RL and x 
when the receptor is in state R, respectively. The work per copy cycle is also given by Eq. 16 of the main text; to see 
this, note that the free energy of the system changes when the molecule x is converted into x* or vice versa, either by 
RL or R; reactions that overwrite x with x or x* with x* do not cost free energy and hence do not contribute to the 
chemical work. The readout x is converted into x* by RL with probability Px^x*-,rl = p'spLi^ — f) and an associated 
chemical work of Similarly, Px^x‘-r = {l-p'){l- sr){ 1- f), Px-^x-rl = p'(I - s^l)/, and Px*^x-r = p'srJ, 

with associated chemical work —Es^^, —Esrl Es^ respectively. Using / = p 'srl + (I —p')(I — sr) and summing 
gives a dissipation per copy cycle equal to Equation 16 of the main text. 

Our clocked biochemical protocol connects to typical computation models that involve manipulating energy land¬ 
scapes, as illustrated in Supplementary Eigure 4 (B). In effect, exposing a readout to a receptor in the RL state 
immediately biases the landscape towards the x* “well” and simultaneously lowers the barrier for the transition. 
Similarly, exposing the readout to R can be interpreted as suddenly raising the x* “well” and lowering the barrier. 
The fundamental difference between Supplementary Figure 4 (B) and a optimal computational protocol, such as that 
in Supplementary Figure I, is the manner in which the landscape switches suddenly, rather than quasistatically. 
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FIG. 8. Supplementary Figure 3: Efficiency of the biochemical network rj as a, function of s_Rand srl for (A) p' = 0.5, 
(B) p' = 0.8 and (C) p' — 0.98. Note the substantial region in which rj exceeds 0.5 that appears as p' grows, centred around 
Srl ~ p and sr « 0.5. Also note that as srl — > 1 at fixed p', p is small. (D) Plot of p against p' = srl at sr = 0.5. Note 
that convergence on unity only occurs at extreme values of srl = p' ■ 


Appendix F: A thermodynamically optimal copy operation using biomolecules 
1. Detailed calculation for an optimal biochemical bit 

The readout begins coupled to a buffer with with AGp, AGd = —AGr {AGr assumed to be large and positive, 
although in fact this initial condition does not influence the copy accuracy or work). There is no receptor in close 
proximity, but the readout has equilibrated at the end of a previous measurement cycle by a receptor in the R state. 
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FIG. 9. Supplementary Figure 4: An artificial biochemical system in which measurements are performed in a clocked 
fashion, analogous to computational models. (A) Schematic of the system - the readout, attached to a polymer by a tether, 
is moved past a series of receptors, (i) The memory is between receptors, and cannot interact with them, (ii) The memory is 
brought into proximity with a receptor, allowing catalysis and copying, (iii) The memory is moved on, and cannot interact with 
any receptors. It retains the state of the previous receptor until the next measurement. (B) An interpretation of the process 
in (A) in a two-state energy landscape picture. In this analogy, the left well corresponds to state x and the right well to x*. 
The presence of an enzyme simultaneously lowers the barrier between x and x* and shifts the equilibrium. 



Reference bit 


FIG. 10. Supplementary Figure 5: A biochemical implementation of an optimal copy device and protocol, in which only 
the RL state of the receptor is catalytically active. The cycle is illustrated in (A). As in the main text, the system consists of 
receptors acting as data and reference bits attached to the ends of a reaction volume, and a tethered readout (memory bit). 
The reference bit is a receptor in state RL, the data bit is a receptor in either R or RL (illustrated here as as RL). Initially, 
the readout is not in close proximity to any receptor, and is in a solution in which AGp = AGs is large and positive (the 
reaction in Eq. |G1| is driven to the left), indicated by the light color of the reaction volume. Its is unphosphorylated due to 
prior coupling to the reference bit. Step 1: the readout is brought into close proximity with the receptor acting as a data 
bit. Step 2: the solution is quasistatically changed from AGp = —AGs to AGp = AGs (changed conditions are shown by the 
darker color). If the receptor is in state RL, the readout tends to be phosphorylated. Step 3: the readout is separated from 
the receptor acting as a data bit. Step 4: the solution is quasistatically changed to AGp = —AGoff, the free energy of reaction 
at which decorrelation will take place. Step 5: the readout is brought into contact with the receptor acting as a reference bit 
to allow decorrelation with the data bit. Step 6: the readout is reset by quaistatically changing conditions to AGp = AG 3 . 
Step 7: the readout is separated from the reference bit, restoring the initial state. (B) A device for implementing this cycle. 
A small reaction volume is coupled to a series of reservoirs of varying ATP, ADP and P content. The current reservoir can be 
changed by pushing a piston. Similarly, the receptors and readout can be brought in an out of proximity by manipulation of a 
second piston. 
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and therefore is in state x with probability 1/(1 + exp(—AGr/fcT)). 


1. The readout is brought into close proximity with a receptor of known state R] no reactions take place on average. 

2. AGp, AGd are slowly (quasistatically) raised from — AG^ to 0. The chemical work is given by the average 
number of reactions of a given type, multiplied by the associated chemical work, integrated over the whole 
process. In the presence of a receptor in state i?, only R + x*^R + a; + P is possible, and in an infinitesimal 
step from AGd to AGd + dAGd, the average number of net dephosphorylation events is 


exp {-AGd/kT) _ exp (-(AGd + dAGrf)/fcr) _ exp {-AGd/kT) dAGd 
1 + exp (—AGd/fcT) 1 + exp(—(AGd + dAGd)//cT) (1 + exp (—AGd/AT))^ kT 


(FI) 


Thus 

dAGd (AGd + AGxjx *) exp (—AGd/kT) 

^2.chem — / TTtt ^ ^ I Z!! 779 

J-AGx kT {1 + exp {-AGd/kT)) 


(F2) 


since —{AGd + AG^/x*) is the chemical work done by the buffer in a single dephosphorylation reaction catalysed 
by R. Computing the integral, 


W2.chem = -kTln 


1 + exp(—AGr/fcT) 


AGrexp{—AGr/kT) /I 1 

1 + exp(—AGr/ftT) \2 1 + exp(—AG^/feT) 


(F3) 


3. The readout is brought into close proximity with a receptor of unknown state (ligand-bound with probability 
p'). AGp, AGd are then slowly lowered to — AGg. In this step, the state of the readout is set to match that of 
the receptor (with some error). Proceeding analogously to Step 2, and considering the possibility of the receptor 
being either R or RL with probabilities 1 — p' and p', respectively, 




+kT In I 


l-|-exp(-AG,/feT) 


AGs exp(-AGs//cT) 
l+exp(-AG,/fc7’) 


+ (1 ‘2'P')AGxIx* (2 l+exp(-AG,/fcT)) ' 


(F4) 


4. The readout is removed from close proximity with the receptor. No reactions occur on average. 

5. AGp,AGd are set to AGo®. No reactions occur on average. This stage constitutes the end of the copy; if 
the unknown receptor is in state RL, then the readout is in x* with probability s = 1/(1 -I- exp(—AGg/fcT)). 
Similarly, if the unknown receptor is in state R, the readout is in x with probability s = 1 /{1 + exp{—AGs/kT)). 
Thus the protocol performs a copy of accuracy s. 

6 . We now decorrelate the memory and data bits. The readout is brought into close proximity with a known 
receptor of state R. The readout relaxes to a state reflective of AGos via the reaction R + x*^R -f a; -I- P. The 
chemical work done is 

"We.chem = — (AGoff + AGx/x*) X P' ( l+exp(-AG,/fer) ~ l+exp(-AGoff/feT) ) 

(F5) 

+ (AGoff + AGx/x*) X (1 — p ) ( l+exp(-AGs/fcT) + l+exp(-AGoff/feT) ~ ' 

7. The readout molecule is reset by quasistatically lowering AGp, AGd to —AGr from AGoff, returning it to a 
state dominated by x. 


_ , / l+exp(AGoff/feT) / AG,.exp(-AGo/feT) AGott exp(AGoff/feT) 

(T7,chem — -l-fi-r m l+exp(-AG^/feT) l+exp(AGott/feT) 


-l-AGo. 


..(1- 


l+exp( —AGoff/AiT) l+exp(—AGr/feT) 


(F 6 ) 


The readout molecule is separated from the known receptor. No reactions occur on average. The system has 
now been returned to the initial state. 
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Summing Eqs. F3 F4 F5 and F6 gives a total chemical work for the cycle of 


Wchem = kT In 


1 + exp(AG'off/fcr) \ 
1 + exp(-AGs/fcr) / 


AGsexp{-AGs/kT) p' + {1 - p') exp{-AGs/kT) 

1 + exp{—AGs/kT) 1 + exp(—AGg/feT) 


(F7) 


Using the fact that the probability of the copy resulting in x* (at the end of step 5) is / = p's + (1 —p'){l — s), the fact 
that the accuracy s is given by s = 1/(1 + exp(—AG^/feT)), and the identity exp(z) = (1 + exp(z))/(l + exp(—z)), 


Wchem = kT In 


/1 + exp(-AGoff/fcr) \ 
\ 1 + exp(—AGs/fcT) ) 


(1 - s)AG, + (1 - /)AGoff. 


(F8) 


Similarly to Supplementary Discussion 3, we can minimize the work with respect to AGoff at fixed p', s and /, 
obtaining AG^fj = kTh\{f/{I — /)). Substituting for AG^g and AG^ using / and s, we see that this particular 
protocol involves work 


■u^chem = fcT (s In s + (1 - s) ln(l - s) - / In / - (1 - /) ln(l - /)), 


(F9) 


which is identical to the information gained by the measurement of the data (Supplementary Fq. C15). Thus the 
biochemical protocol with decorrelation free energy AG^g = fcTln(//(1 — /)) is a thermodynamically optimal protocol. 


2. Subtleties in the comparison between biochemical systems and energy landscapes 
a. The role of receptors 


In the energy-landscape description presented in Supplementary Figure 1, it was assumed that we are separately able 
to lower the barrier between wells and raise one well in energy with respect to the other. In the quasistatic biochemical 
protocol presented in the main text, the receptor acts as a catalyst (lowering the barrier between activation states 
of the readout), but the state of the receptor also determines which state is favourable. Thus it is not possible to 
formally separate biasing and lowering/raising of the barrier in the same way as in the energy landscape picture (the 
strength of the bias when a receptor is present, but not its direction, is determined by the buffer to which the memory is 
connected). Indeed, it is not really meaningful to draw energy landscapes in a two-state description of the biochemical 
system in which the fuel is coarse-grained away, unless there is a single receptor present. As we demonstrate, however, 
this does not prevent the biochemical system performing efficient measurement, and the purpose of each of the 8 steps 
in the biochemical protocol is closely analogous to its counterpart in Supplementary Figure 1. 


b. Chemical work 

We claim that there is a close analogy between the optimal biochemical protocol in the main text and the energy- 
landscapes description in Supplementary Figure 1. Further, we claim that the purpose of the individual steps are the 
same. However, the work calculated for the individual steps are quite different; for example, the “chemical work” 
during the decorrelation step (step 6) in the biochemical protocol is non-zero, whereas the work done is zero in the 
energy landscapes picture. 

The work for the protocol in Supplementary Figure 1 is calculated in Supplementary Discussion 3 as is typical 
in stochastic thermodynamic^^. Work is done when the energy landscape is manipulated, and heat is exchanged 
when the particle moves within the landscape. Although this approach is well-established for such systems, it is less 
clear how to proceed when the external manipulation involves the concentrations of reactants, as in our biochemical 
approach. Furthermore, as discussed above, the use of (free-) energy landscapes to describe the biochemical system 
at a coarse-grained level is not straight-forward. 

Instead, we choose to calculate the change in chemical free energy associated with the buffers (the negative of the 
“chemical work” they perform). This quantity is well defined, and in a measurement cycle (in which the memory 
returns to its initial state) corresponds to the overall change in free energy of the entire system. A decrease in free 
energy of the system corresponds to a reduction of its ability to do work, and is therefore “dissipation” unless this 
drop is harnessed to do work. 
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Appendix G: An optimal protocol involving a receptor that functions only as a kinase 


It is possible to construct a measuring device using only the reaction 

RL + X + ATP^i?L + a;* + ADP 


(Gl) 


if the R state of the receptor does not catalyse a reaction. The procedure is illustrated in Supplementary Fig. 5, 
and outlined in detail below. As in Supplementary Discussion 6, we calculate the chemical work performed by the 
ATP/ADP reservoirs during a cycle, in which the unknown receptor is in state RL with probability p\ We start with 
the readout isolated from any receptors, and in a buffer with a high ADP concentration and a low ATP concentration. 
In this case, the reaction in Eq. Gl is driven to the left; AGp = AGs where AGs is large and positive. The readout 
has previously been equilibrated by a receptor in the RL state at this value of AGp] it is therefore predominantly 
in the x state. Note that since the measurement of R will be passive, the initial state determines the accuracy of 
measurement and hence AGs appears here, rather than an arbitrary reset accuracy AGr- 

1. The readout is brought into close proximity with the unknown receptor. No reactions occur on average 

2. AGp is slowly (quasistatically) lowered from AGs to — AGs- H the unknown receptor is in state RL, the readout 
tends to be converted to x* - no reactions take place if the receptor is in state R. The average chemical work 
done is 


_ f- — AGs dAGp (AGp — AGa./a.* ) exp (— /\Gp/kT') 

^2,chem = P JaG, WT 


(1+exp {-AGp/kT)y 


— p AGxjx* ^ ] 


(G2) 


l+exp(-AGs/fcT) 1+exp (AGs/feT)^ 

The relation exp(x) = (1 + exp(ai))/(l + exp(—a;)) is useful in deriving this result. 

3. The readout is separated from the receptor; no reactions take place on average. 

4. The buffers are slowly changed so that AGp = —AGoff. No reactions take place. This stage constitutes 
the end of the copy; if the unknown receptor is in state RL, then the readout is in x* with probability s = 
1/(1 + exp(—AGs/A:T)). Similarly, if the unknown receptor is in state R, the readout is in x with probability 
s = 1/(1 + exp(—AGs/fcT)). Thus the copy has been performed with accuracy s = 1/(1 + exp(—AGs/fcT)). 

5. The following steps involve decorrelation and resetting. The readout is brought into contact with the receptor 
of known state RL. The readout relaxes to a state representative of the free-energy difference AGp = —AGoff. 
The chemical work done is 


1+exp ( —AGoff/AiT) 1+exp (—AGs/fcT) 


1^5,chem — P {^Gx,x* H" 

+ (1 ~ P ) {AGx,x* + AGoff) ( 1+exp (- AGoff/feT) ~ 1+exp (AG^/fcT). 


(G3) 


6. The buffer is slowly changed from AGp = — AGoff to AGp = AG^. The readout is restored to a state dominated 


by X. The chemical work done is 

_ rAGs dAGp {^Gp — AGp/p*)ejcp{ — AGp/kT) 1.7^ Ip, f l+®^p(~AGoff/fcT) \ 

'd6,chem — J_AGoff kT ( 1 +exp (-AGp/fcr))^ — m ^ l+exp(AGGfcT) J 

I AGa exp(AG>/feT) . AGoff exp(-AGoff/fcT) i A/ 1 1 

l+exp(AGa/fcT) l+exp(-AGoff/feT) i+gxp (AGa/feT) 


(G4) 


1+exp (AGoff/fcT) ^ 

7. The readout is removed from the receptor. No reactions take place. The system is restored to the initial state. 
Summing over all terms gives 


, 1 + exp(-AGoff/fcT) ^ ^ AGsexp(AG^/fcr) 
1 + exp(AGs/kT) ) 1 + exp(AGs/fcr) 


AGoff 


(1 -p') +p' exp{-AGs/kT) 


1 + exp{—AGs/kT) 

Utilising exp(x) = (1 + exp(a:))/(l + exp(—a;)) once more. The Eq. G5 can be re-written 

i-AGs/kT) 


'^chem — 


/ 1 -k exp(AGoff//cT) \ AGs exp(-AG^/feT) ^ p' + {I - p') exp( 
U + exp(-AG,/fcr); 1 + exp(-AG,/fcT) l+exp(-A( 


-AGs/kT) 


(G5) 


(G6) 
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which is identical to Supplementary Eq. F7 The above method therefore has the same outcome and thermodynamic 
cost as those analysed in Supplementary Discussions 3 and 6 It has the advantage of requiring the manipulation of 
AGp only, but the disadvantage that the reset step is actually necessary. The reset step is needed because the R state 
does not actively set the the memory to a;; we rely on the memory being prepared in the x state prior to copy. 

I 




